Doppelgänger Finder: Taking Stylometry to the Underground

نویسندگان

  • Sadia Afroz
  • Aylin Caliskan
  • Ariel Stolerman
  • Rachel Greenstadt
  • Damon McCoy
چکیده

Stylometry is a method for identifying anonymous authors of anonymous texts by analyzing their writing style. While stylometric methods have produced impressive results in previous experiments, we wanted to explore their performance on a challenging dataset of particular interest to the security research community. Analysis of underground forums can provide key information about who controls a given bot network or sells a service, and the size and scope of the cybercrime underworld. Previous analyses have been accomplished primarily through analysis of limited structured metadata and painstaking manual analysis. However, the key challenge is to automate this process, since this labor intensive manual approach clearly does not scale. We consider two scenarios. The first involves text written by an unknown cybercriminal and a set of potential suspects. This is standard, supervised stylometry problem made more difficult by multilingual forums that mix l33t-speak conversations with data dumps. In the second scenario, you want to feed a forum into an analysis engine and have it output possible doppelgängers, or users with multiple accounts. While other researchers have explored this problem, we propose a method that produces good results on actual separate accounts, as opposed to data sets created by artificially splitting authors into multiple identities. For scenario 1, we achieve 77% to 84% accuracy on private messages. For scenario 2, we achieve 94% recall with 90% precision on blogs and 85.18% precision with 82.14% recall for underground forum users. We demonstrate the utility of our approach with a case study that includes applying our technique to the Carders forum and manual analysis to validate the results, enabling the discovery of previously undetected doppelgänger accounts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of 3D Registration Algorithms for Autonomous Underground Mining Vehicles

The ICP algorithm and its derivatives is the de facto standard for registration of 3D range-finder scans today. This paper presents a quantitative comparison between ICP and 3D NDT, a novel approach based on the normal distributions transform. The new method addresses two of the main problems of ICP: the fact that it does not make use of the local surface shape and the computationally demanding...

متن کامل

Sustainable development in Urban Underground Space

During a very long period of time, civil engineers have been the only ones to be designated as the experts for underground space, while the planners and architects were the ones of the development at the surface. Cities worldwide tend to overlook an invaluable asset that lies beneath their surfaces. Most cities and urban regions are unaware of the benefits underground space use has to offer, bo...

متن کامل

Doppelgänger: a solenoid-based large scale sound installation

This paper presents the sound art installation Doppelgänger. In Doppelgänger, we combine an artistic concept on a large scale with a high degree of control over timbre and dynamics. This puts great demands on the technical aspects of the work. The installation consists of seven 3.5 meters-tall objects weighing a total of 1500 kilos. Doppelgänger transfers one soundscape into another using audio...

متن کامل

MATHEMATICAL MODELLING FOR DICE FINDER GAME PROBLEM

Play is often episodic and mission-centric, with a series of challenges culminating in a final puzzle or enemy that must be overcome. Multiple missions played with the same characters may be related to each other in a plot arc of escalating challenges. The exact tone, structure, pace and end (if any) vary from game to game depending on the needs and preferences of the players, as in [9]. "THE C...

متن کامل

Unified Pulsed Laser Range Finder and Velocimeter using Ultra-Fast Time-To-Digital Converter

In this paper, we present a high accuracy laser range finder and velocimeter using ultra-fast time-to-digital converter (TDC). The system operation is based on the measuring the round-trip time of a narrow laser pulse. A low-dark current high-speed PIN photodiode is used to detect the triggered laser beam and to produce start signal. The pulsed laser diode generates 45W optical power at 30ns du...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014